Compiling Regular Computations to Fine-Grained Linear Processor Arrays

نویسنده

  • Darren C. Cronquist
چکیده

Fine-grained linear processor arrays are an important class of architectures for obtaining high performance on computationally intensive algorithms with large data sets, as found prevalently in digital signal processing and scientiic computing. The vast number of processing elements on these architectures provides a immense amount of potential parallelism but at the price of limited interconnect and ne-granularity, resulting in complex constraints on synthesis. In an attempt to overcome such restrictions, compilers must search for both iteration-level and instruction-level parallelism. This paper evaluates the iteration-level scheduling techniques of space-time mapping and loop transformation theory, and the instruction-level techniques of software pipelining and trace scheduling. In particular, the applicability of each technique to linear ne-grained processor arrays is discussed. The conclusion is that a combination of loop transformation theory, software pipelining, and space-time mapping provides a potentially powerful method for scheduling to this class of architectures.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mapping Applications to Coarse-Grain Reconfigurable Architectures

Coarse-grained reconfigurable architectures (CGRAs) are capable of achieving both goals of high performance and flexibility. CGRAs not only improve performance by exploiting the features of repetitive computations, but also can adapt to diverse computations by dynamically changing configurations of an array of its internal processing elements (PEs) and their interconnections. Many CGRAs have be...

متن کامل

Energy-Efficient Sorting on a Many-Core Platform

As processors move from multi-core to many-core architectures, opportunities arise for energy-efficient enterprise computations, such as sorting, on large arrays of processors. This paper proposes three different energy-efficient sorting methods for the first phase of an external sort simulated on a varying sized fine-grained many-core processor arrays used as a co-processor to an Intel CPU, wh...

متن کامل

Fast Synthesis of Large Planar Arrays Using Active Element Pattern Method and Fine- Grained Parallel Micro-genetic Algorithm

A radiation pattern synthesis technique for large planar arrays with active element pattern (AEP) method and fine-grained parallel micro-genetic algorithm (FGPMGA) is presented. Based on the AEP method, the mutual coupling between array elements can be taken into account. Analysis problems of large rectangular and triangular grid planar arrays are divided into small linear array problems. And f...

متن کامل

Compiling Array Expressions for Efficient Execution on Distributed-Memory Machines

Array statements are often used to express data-parallelism in scientiic languages such as Fortran 90 and High Performance Fortran. In compiling array statements for a distributed-memory machine, eecient generation of communication sets and local index sets is important. We show that for arrays distributed block-cyclically on multiple processors, the local memory access sequence and communicati...

متن کامل

How to Emulate Fine-grained Multithreading

Fine-grained multithreading can be used to hide longlatency operations encountered in parallel computers during remote memory access. Instead of using special processor hardware, the emulation of fine-grained multithreading on standard processor hardware is investigated. While emulation of coarse-grained multithreading is common in modern operating systems, in the fine-grained case research on ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007